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1.  INTRODUCTION 

One  of  the  strategic  objectives  for  Information  and 
Knowledge  Management  (IKM)  in  Canadian  Command 
and  Control  Information  Systems  (C2IS)  consists  in 
investigating  and  advancing  knowledge  creation  and 
discovery  techniques  through  which  information  is 
collected  and  processed  to  support  situation  analysis  and 
gain  sufficient  situational  awareness  to  be  able  to  project 
possible  future  courses  of  action  or  trends  with 
confidence.  In  2001,  the  Future  Army  Capabilities  report 
(DND,  2001)  pointed  out  that  “without  some  fundamental 
change,  current  army  ISR1  will  be  incapable  of  providing 
the  degree  of  knowledge  that  will  be  required  by  future 
commanders.”  Therefore  “all  relevant  data,  information 
and  knowledge  must  be  available  at  all  levels,  but 
managed  in  a  way  that  produces  a  current,  rapid  and 
coherent  understanding  of  the  battlespace,  while  at  the 
same  time  allowing  the  various  levels  of  command  to 
process  the  relevant  material  for  their  specific  purposes.” 

Initiated  in  2004  at  Defence  Research  and  Development 
Canada  (DRDC),  the  SACOT2  knowledge  engineering 
research  project  aims  at  investigating,  developing  and 
validating  innovative  natural  language  processing  (NLP) 
approaches  as  scientific  means  to  generate  rapidly  broad 
ontologies  using  large  amount  of  electronic  texts. 

Ontologies  are  key  elements  required  to  enable  decision 
support  systems,  knowledge  exploitation  and  information 
retrieval  systems  with  new  semantic  capabilities.  Since 
Gruber  (Gruber,  1993),  the  scientific  community  defines 
an  ontology  as  a  formal,  explicit  specification  of  a  shared 
conceptualization.  When  the  knowledge  of  a  domain  is 
represented  in  a  declarative  formalism  such  as  in  an 
ontology,  the  set  of  objects  that  can  be  represented  is 
called  the  universe  of  discourse.  This  set  of  objects,  and 
the  formalized  relationships  among  them,  are  reflected  in 
the  representational  vocabulary  (id.).  Domain  ontologies 
provide  vocabularies  about  the  concepts  within  a  domain 
and  their  relationships,  about  activities  that  take  place  in 
that  domain  and  about  theories  and  elementary  principles 
governing  that  domain  (Corcho  et  al ,  2003). 


1  Intelligence,  Surveillance,  Reconnaissance  (ISR) 

2  Semi-Automatic  Construction  of  Ontologies  from  Texts 
(SACOT) 


2.  NATURAL  LANGUAGE  PROCESSING  FOR 
ONTOLOGY  ENGINEERING 

Knowledge  objects  of  a  given  domain  are  expressed  and 
conveyed  in  texts  using  domain-specific  terminology.  As 
two  of  the  core  components  of  domain  ontologies  are 
concepts  and  relations  among  concepts,  the  SACOT 
project  will  focus  on  corresponding  NLP  research  areas. 
Innovative  terminology  extraction  techniques  will  be 
developed,  validated  and  used  to  automatically  build  the 
inventory  of  concepts  of  a  given  domain  from 
representative  texts.  Semantic  relations  extraction 
strategies  will  be  developed  and  then  deployed  to  identify 
automatically  semantic  relations  among  the  concepts  of  a 
domain.  Draft  ontologies  will  be  automatically  compiled 
and  generated  using  the  two  core  components  extracted 
from  the  corpus  (concepts  and  semantic  relations)  and 
other  reference  material.  Knowledge  engineers  will  use 
this  ontology  acquisition  prototype  as  a  knowledge 
framework  in  order  to  validate  and  enhance  the  draft 
ontologies.  Their  work  will  be  monitored  and  supported 
by  machine  learning  techniques  in  order  to  capture 
validation  rules  and  to  provide  the  system  with  self- 
learning  capabilities.  While  validating  the  content  of  the 
draft  ontologies,  knowledge  engineers  will  teach  the 
system  for  which  of  all  the  potential  semantic  relations 
identified  in  texts  are  most  valuable  and  which  should 
simply  be  set  aside. 


Fig.  1  The  SACOT  Knowledge  Engineering  Framework 
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3.  ONTOLOGY  ENGINEERING  METHODS 

Most  of  published  knowledge  engineering  methods 
(Corcho  et  al.9  2003)  require  interviews  with  Subject 
Matter  Experts  (SMEs).  In  ah  the  approaches  relying 
heavily  on  SMEs,  the  extent  of  the  domain  represented  in 
the  ontology  is,  to  a  certain  extent,  tailored  to  the 
expertise  and  the  degree  of  “expressiveness”  of  the  SMEs. 
This  limitation  might  lead  to  unacceptable  and  mediocre 
performance  of  ontology-based  information  systems. 
Typically,  domain  terminology  can  contain  from  few 
hundreds  (e.g.  PGA  Glossary  of  Golf)  to  several  hundreds 
of  thousands  terms  (e.g.  up  to  160,000  terms  in  a  medical 
dictionary).  It  is  unlikely  that  any  SME  interview  will 
ever  elicit  the  whole  terminology  of  a  domain.  We  need  to 
turn  to  more  exhaustive  and  objective  data  sources.  The 
main  impediments  to  classic  development  of  ontologies 
are  that  it  is  a  time  and  budget  consuming  operation  and 
that  it  is  bound  to  SMEs’  own  knowledge  limitations. 
Exhaustive  elicitation  of  knowledge  objects  of  a  domain 
requires  the  application  of  NLP  extraction  techniques 
over  textual  data. 

4.  ONTOLOGIES  FOR  INTELLIGENT  COMMAND 
AND  CONTROL  SYSTEMS 

Research  from  the  military  community  (Bourry-Brisset, 
2000;  Chance  et  al.9  2003;  Gauvin  et  al.9  2004,  Gouin  et 
al.9  2003)  clearly  indicates  that  there  are  many  needs  and 
many  potential  uses  for  ontologies  within  the  military 
domain  similar  to  industry.  With  the  development  and 
maturity  of  the  Semantic  Web  (Davies  et  al9  2003),  the 
ontology  will  be  the  cornerstone  technology,  which  shares 
a  common  understanding  of  a  domain  among  humans, 
agents  and  machines.  Ontologies  for  Command  and 
Control  Systems  will  be  instrumental  in  establishing  a 
Common  Operational  Picture  (COP)  among  units  by 
making  domain  analysis,  situation  analysis  and 
assumptions  more  explicit.  Agents  assisting  commanders 
with  the  Command  and  Control  task  will  have  the  ability 
to  “interpret”  data  and  know  its  meaning  and  value  based 
on  the  domain  ontology. 

5.  CONCLUSION 

SACOT’s  methods  and  tools  will  significantly  reduce 
time  usually  required  to  capture  the  knowledge  objects  of 
a  domain  in  traditional  ontology  building  processes.  They 
will  provide  knowledge  engineers  with  means  to  rapidly 
build  broad  domain  ontologies  for  their  applications. 
Outcomes  of  this  knowledge  engineering  framework  will 
provide  benefits  for  situational  awareness  portals,  for 


ontology-based  automatic  document  classification 
systems,  for  ontology-based  data  mining,  for  knowledge 
portals,  for  intelligent  search  engines  and  for  any  other 
application  requiring  ontologies  to  reach  semantic-level 
capabilities.  This  research  project  will  also  be  a  building 
block  in  the  development  of  generic  ontology  services 
within  the  Context  -  Ontology  -  Portfolio  knowledge 
management  portal  project  (Gauvin  et  al9  2004)  currently 
ongoing  at  DRDC  Valcartier. 
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